Extreme Value Theory Based Text Binarization In Documents and Natural Scenes
نویسندگان
چکیده
This paper presents a novel image binarization method that can deal with degradations such as shadows, nonuniform illumination, low-contrast, large signal-dependent noise, smear and strain. A pre-processing procedure based on morphological operations is first applied to suppress light/dark structures connected to image border. A novel binarization concept based on difference of gamma functions is presented. Next Generalized Extreme Value Distribution (GEVD) is used to find proper threshold for binarization with a significance level. Proposed method emphasizes on region of interest (with the help of morphological operations) and generates less noisy artifacts (due to GEVD). It is much simpler than other methods and works better on degraded documents and natural scene images. Keywords-Generalized extreme value distribution; Geodesic transform morphological reconstruction; Connected opening; Text binarization
منابع مشابه
Detecting Text in Natural Scenes Based on a Reduction of Photometric Effects: Problem of Text Detection
In this paper, we propose a novel method for detecting and segmenting text layers in complex images. This method is robust against degradations such as shadows, non-uniform illumination, low-contrast, large signaldependent noise, smear and strain. The proposed method first uses a geodesic transform based on a morphological reconstruction technique to remove dark/light structures connected to th...
متن کاملDetecting Text in Natural Scenes Based on a Reduction of Photometric Effects: Problem of Color Invariance
In this paper, we propose a novel method for detecting and segmenting text layers in complex images. This method is robust against degradations such as shadows, non-uniform illumination, low-contrast, large signaldependent noise, smear and strain. The proposed method first uses a geodesic transform based on a morphological reconstruction technique to remove dark/light structures connected to th...
متن کاملAn Analysis of Image Binarization Techniques for Natural Scene Images
Text extraction from natural scene images is an emerging field in computer graphics. Extracted text contains important information that can be used for various purpose like vehicle number plate detection to identify the vehicle, to provide information of surrounding to visually impaired persons, preservation of information of historical documents etc. Binarization is a key process in text extra...
متن کاملFont and Background Color Independent Text Binarization
We propose a novel method for binarization of color documents whereby the foreground text is output as black and the background as white regardless of the polarity of foreground-background shades. The method employs an edge-based connected component approach and automatically determines a threshold for each component. It has several advantages over existing binarization methods. Firstly, it can...
متن کاملTowards Text Recognition in Natural Scene Images
In this paper, we propose a novel methodology for text detection in natural scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully processes natural scene images having shadows, non-uniform illumination, low contrast and large signaldependent noise. Conn...
متن کامل